"Express Mail" mailing label number: ER 631 153232 US 
Date of Deposit: March 22. 2004 



Our Case No.l 1474-4 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
APPLICATION FOR UNITED STATES LETTERS PATENT 



INVENTOR: John Nicholas COLEMAN 

TITLE: LOGARITHMIC ARITHMETIC 

SYSTEM 



ATTORNEY: DANIEL B. SCHEIN, PH.D., ESQ. 

REGISTRATION NUMBER 33,551 

P.O. BOX 28403 
SAN JOSE, CA 95159 



(408) 294-6750 



-1- 

LOGARITHMIC ARITHMETIC SYSTEM 

This application is a continuation-in-part of co-pending U.S. Patent Application Serial No. 
09/700,353, filed November 13, 2000, which is specifically incorporated herewith as if 
5 recited below in its entirety 

The present invention relates to a logarithmic calculating apparatus and method, and 
relates particularly, but not exclusively, to a logarithmic calculating apparatus and method 
to be incorporated within microprocessors. 

10 

In electronic computational machines, it is known to represent real numbers upon which 
the machines operate using the "floating point" system, which will be famiUar to persons 
skilled in the art. Using the "floating point" system, add, subtract and multiply operations 
can be exercised at reasonable speed. However, this system suffers from the drawback that 
15 divide operations take place much more slowly, and all operations are subject to a 
maximum rounding error of half of a least significant bit. 

An alternative known approach is to represent real numbers as fixed-point logarithms, this 
approach generally being referred to as a Logarithmic Number System (LNS). Using such 
20 a system, multipUcation and division operations may be implemented very rapidly and 
with no error, but this is at the expense of considerable complexity, latency and lack of 
accuracy in the addition and subtraction processes. 

The operation of a conventional Logarithmic Number System, such as in existing 
25 arithmetic devices, is shown schematically in Figure 1, in which addition and subtraction 
operations require the evaluation of: 



30 



log2(2'+2^) = i + log2(l+2^'"); 

Iog2(2'-2^) = i + log2(l-2''0; 
where j is less than or equal to i. 
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The logarithmic function F(r = Q -1)) = log2(l+2^'^) for the add operation is shown in 
Figure 2. The device shown in Figure 1 is provided with look up tables which store the 
value of the function F(r) for a range of discrete values of r. For practical applications, it 
5 is not possible to store the function for all values of r, as this would require a look up table 
of impracticable size, and so the value of the function is therefore stored at intervals of 
width A. To further minimise the storage requirements, A is progressively increased as the 
function becomes more linear with decreasing r, and an intervening value of r lying in the 
nth interval is thus expressed as: 

10 

r=-5 {n=0} ; r=E(-4i) - 6 {n > 0} 

For clarity, however, this description omits any further reference to the variation in A, and 
the expression may be abbreviated to r = -nA - 5. 

15 

Alongside each value of the function F(r) is stored its derivative, or slope at that point, 
D(r). The function for any intermediate value of r is then obtained by interpolation with a 
first-order Taylor-series: 

20 F(-nA-6) = F(-nA)-5D(-nA) 

Figure 3 shows in detail the function F(r) within one interval of width A. The error € 
between the true value of the Function F(r) at position 8 and that interpolated from the 
derivative at an adjacent stored position is shown. Referring to Figure 1, the apparatus has 

25 a selector/subtracter 1 for deriving the values of i and r=j-i, a look up table 2 containing the 
value of the function F(r) at discrete values of r, and a look up table 3 containing values of 
the derivative D(r) at discrete values of r. A multiply stage 4 multiplies the derivative D(r) 
by 5 and a carry-save-add stage 5 adds together the outputs of the selector/subtracter 1, 
look up table 2 and multiplier 4. The output of the carry-save-add stage 5 is then fed to a 

30 carry-propagate-add stage 6. 
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It can therefore be seen that following the initial subtraction in selector/subtracter 1 to 
obtain r, r is partitioned into lower and higher order segments, so effectively dividing by A. 
The higher order segment represents n and is used to access the function F(r) in look up 
5 table 2 and derivative D(r) in look up table 3, whilst the lower order segment represents 6. 
F(-nA) stored in look up table 2 is then added in the carry-save-add stage 5 to the product 
6.D(-nA) obtained from the multiplier 4, so as to obtain the approximation to F(r) at the 
value of r of interest, to which is also added i, the addition being completed in the final 
carry-propagate-add stage 6, so as to yield the result. 

10 

As can be seen from Figure 3, the prior art arrangement described with reference to Figures 
1 to 3 includes an interpolation error: 

€(n,6) = F(-nA) - 5.D(-nA) - F(-nA-6) 

15 

and for each n, e increases with 5 to a maximum: 

E(n) = F(-nA) - AD(-nA) - F(-nA - A) approximately. 

20 In any practical implementation, the error e must be minimised to an acceptable 

magnitude. This can be achieved by decreasing the increment A, but at the cost of a 
considerable increase in the required capacity of the look up tables. Alternatively, 
methods are known which apply correction algorithms to minimise the error without 
increasing the necessary capacity of the look up tables. However, these all suffer from the 

25 drawback that they involve many extra stages in the computation process, which are 
executed serially, and therefore introduce speed limitations. 

Preferred embodiments of the present invention seek to overcome the above disadvantages 
of the prior art. 

30 
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According to an aspect of the present invention, there is provided a logarithmic calculating 
apparatus for determining the approximate value of a logarithmic function F(x) at a value 
of X of interest, the apparatus comprising: 

5 first memory means for storing values of the function F(x) for a plurality of discrete values 
ofx; 

second memory means for storing values of F^x), the slope of the function F(x), for said 
plurality of discrete values ofx; 

10 

first multiplier means, or equivalent thereof, for multiplying the value of F*(x) for a said 
discrete value of x adjacent to the value ofx of interest by 8, the difference between said 
adjacent discrete value ofx and the value of x of interest; 

15 third memory means for storing values of E(x), the approximate difference between [F(x) 
+ A.F'(x)] and F(x+A) where x and x+A are the adjacent pair of said discrete values of x 
nearest to the value of x of interest; 

fourth memory means for storing values of P(5), the ratio of (i) the difference between 
20 F(x+5) and [F(x) + 5.F'(x)], to (ii) E(x) for a plurality of values of 6; 

second multiplier means, or equivalent thereof, for multiplying together the outputs of said 
third and fourth memory means; and 

25 , adder means for adding together the outputs of said first memory means, said first 

multipUer means and said second multipUer means to produce an output representing the 
approximate value of F(x) at the value of x of interest. 
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The present invention is based on the surprising observation that for many logarithmic 
functions, the ratio function P(5) for a given value of 8 is almost, although not exactly, 
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constant for all values of n. This provides the advantage that it is possible to store for a 
single value of n a look up table containing the values of P(6) at successive points 
throughout an interval of width A. This in turn provides the advantage of enabling the 
logarithmic function F(x) to be calculated accurately for a value of x of interest without 
5 significantly increasing the necessary size of look up tables in the apparatus, and without 
significantly decreasing the speed of calculation. For example, the necessary size of look 
up tables may be doubled compared with the prior art, which does not cause significant 
problems. It will also be appreciated by persons skilled in the art that the quantities x, 
F(x), FXx),6, E(x) and A defined above can be positive or negative. 

10 

In a preferred embodiment, said first and second multiplier means operate substantially 
simultaneously in use. 

This provides the advantage of not decreasing the speed at which the calculation process 
1 5 can be carried out. 

The apparatus may fiirther comprise fiulher adder means. 

The or each adder means may be a carry-save-add means cooperating with at least one 
20 carry-propagate add means. 

This provides the advantage of minimising the extent to which fiuther adder stages slow 
down the calculation process. 

25 In a preferred embodiment, said first, second, third and fourth memory means are accessed 
substantially simultaneously in use. This provides the advantage of not decreasing the 
speed of the calculation process. 
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According to another aspect of the invention, there is provided a microprocessor including 
a logarithmic calculating apparatus as defined above. 
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According to a further aspect of the present invention, there is provided a method of 
determining the approximate value of a logarithmic function F(x) at a value of x of interest 
in a logarithmic calculating apparatus, the method comprising the steps of: 

5 

storing values of the function F(x) for a plurality of discrete values of x; 

storing values of F^x), the slope of the function F(x), for said plurality of discrete values of 

10 

muhiplying the value of F^x) for a said discrete value of x adjacent to the value of x of 
interest by 6, the difference between said adjacent discrete value of x and the value of x of 
interest; 

15 storing values of E(x), the approximate difference between [F(x) + A.FXx)] and F(x+A) 

where x and x+A are the adjacent pair of said discrete values of x nearest to the value of x 
of interest; 

storing values of P(5), the ratio of (i) the difference between [F(x) +6.FXx)] and F(x+8), to 
20 (ii) E(x) for a plurality of values of 5; 

multiplying together the values of E(x) and P(5) for the value of x of interest; and 

adding together said values of F(x), 6.F'(x) and E(x).P(8) to provide the approximate value 
25 ofF(x)forthe value of X of interest. 

In a preferred embodiment, said multiplication steps are carried out substantially 
simultaneously. 
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Said addition step may be carried out by means of a carry-save-add stage cooperating with 
a subsequent carry-propagate-add stage. 

The stored values are preferably accessed substantially simultaneously. 

5 

This provides the advantage of not decreasing the speed of the calculation process. 

A preferred embodiment of the invention will now be described, by way of example only 
and not in any limitative sense, with reference to the accompanying drawings, in which:- 

10 

Figure 1 is a schematic representation of a prior art Logarithmic Number System; 

Figure 2 is a graph showing the variation of the logarithmic function F(r) = log2(l+2^"^), 
where r = j-i, with r; 

15 

Figure 3 is a diagram showing the relationship between the quantities €, E(r), 8 and A for a 
first embodiment of the present invention; 

Figure 4 is a schematic representation of a logarithmic arithmetic apparatus for 
20 implementing the embodiment of Figure 3; and 

Figure 5 is a diagram showing the relationship between the quantities €, E(r), 8 and A for a 
second embodiment of the present invention. 

25 Referring to Figure 4, a logarithmic arithmetic apparatus embodied within a 

microprocessor has a selector/subtracter 101 having an output 102 representing i and an 
output 103 representing r=i-i. The output 103 of selector/subtracter 101 is input to a look 
up table 104 storing values of the function F(r) at discrete values of r, a look up table 105 

storing values of D(r), the derivative or slope of F(r) at discrete values of r, a look up table 
30 106 storing values of E(r), the difference between [F(r) + A.D(r)] and F(rH-A) for each 
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interval of width A, and a look up table 107 for storing values of P(5), the ratio of the 
difference between F(r+6) and [F(r) + 6 .D(r)] to E(r) for a plurality of values of 6. 



The output 103 of selector/subtracter 101 is thus divided into a high order segment 
5 representing n and a low order segment representing 5, the high order segment being 

input to the look up tables 104, 105 and 106, and the low order segment being input to look 
up table 107. The low order segment is also input to a multiplier 108 together with the 
output of look up table 105 to produce ah output representing the product of 6 and the 
derivative D(r). The outputs of look up tables 106 and 107 are input to a multiplier 109 
10 which multiplies those quantities together to provide an output representing the error in the 
interpolated value of the function. 

The outputs of look up table 104 and multipliers 108 and 109 are input to a carry-save-add 
stage 1 10, the two outputs of which are input to a further carry-save-add stage 1 1 1, to 
15 which the output 102 of selector/subtracter 101 is also input. The two outputs of carry- 
save-add stage 111 are then input to a carry-propagate-add stage 112, the output of which 
represents the value of the overall logarithmic function i + log2(l+2^'^) to be calculated. 

The operation of the apparatus shown in Figure 4 will now be described. 

20 

The values j and i are input to the selector/subtracter 101, as a result of which the output 
103 of subtracter 101 causes simultaneous access to look up tables 104, 105, 106 and 107. 
The multiplier 108 then determines the product 5.D(r), while the multiplier 109 operates 
simultaneously to determine the product E(r).P(6). The outputs of look up table 104 and 
25 multiphers 108, 109 are then added in carry-save-add stage 1 10 to determine the value of 
the logarithmic function F(r). The remaining part i of the quantity to be evaluated is 
added to the value of F(r) in carry-save-add stage 111 and input to the carry-propagate-add 
stage 112, which outputs the result to be determined. 



« 



-9- 

The carry-save-add stages 110 and 1 1 1 do not significantly decrease the speed of 
calculation by the device, and decrease in the speed of calculation is avoided by 
simultaneous operation of look up tables 104, 105, 106 and 107, and simultaneous 
operation of multipliers 108 and 109. In a practical embodiment, the P look up table 107 
S is not implemented for all possible values of d but instead is implemented at intervals 
throughout its range. In one practical embodiment of the invention, based on a forty bit 
Logarithmic Number System using a 4096 word P look up table 107, appUcation of the 
invention improves the worst case error from around five thousand times the least 
significant bit, to around four times the least significant bit. In this case, the look up table 

10 capacity occupied by the new E look up table 106 and P look up table 107 is approximately 
the same as that akeady occupied by the F look up table 104 and D look up table 105. 
Thus the total table capacity is no more than doubled. With appropriate rounding, this 
forty bit system can be used to perform the intemal operations of a 32 bit system, and 
when used in this way, worst case accuracy approaching 0.5 of a least significant bit is 

15 achievable. 

Referring now to Figure 5, which utihses the same hardware as the embodiment of Figure 
4, in a second embodiment of the present invention, the linear fiinction G(x) is a tangent to 
the curve F(x) at a point between two adjacent values of x. As in the embodiment of 

20 Figures 3 and 4, look up table 104 holds a plurality of discrete stored values of the fimction 
F of interest. The value of F(x) at any point 8 within the interval is obtained by adding the 
adjacent stored value of F(x) to the product 6D(5) where D represents a slope and is stored 
along with each stored value of F. The error s incurred by this approximation is corrected 
by multiplying the maximum error in this interval, E, by the proportion, P, of this 

25 maximum accruing around the point 5, this error then being added back into the final 
result. 

However the look up table 105 of the embodiment of Figure 5 differs from that of the 
embodiment of Figures 3 and 4 now contains the values of the slope, F^x), of the curve at 
30 a point lying anywhere in the interval defined by the two stored values adjacent to the 
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value X of interest. Depending on the point at which the slope was calculated, E may now 
occur at either end of the interval. This has the advantage of decreasing the maximum 
error, E compared with the embodiment of Figures 3 and 4. 

5 It will be appreciated by persons skilled in the art that the above embodiment has been 
described by way of example only, and not in any limitative sense, and that various 
alterations and modifications are possible without departure jfrom the scope of the 
invention as defined by the appended claims. For example, the invention can be used to 
determine many types of logarithmic function (i.e. a function having a logarithmic term) 

10 and not just the specific addition example set out above. Also, the interval A can be of 

constant or variable width over the intervals for which values are stored, with appropriate 
treatment of the input address to the F, D, E and P look up tables. The method of the 
invention can also be used for the logarithmic subtraction function set out on page 1, with 
the possible exception of the range r = 0 to r = -1. However, methods will be familiar to 

15 persons skilled in the art, for example J.N. Coleman "SimpUfication of Table Structure in 
Logarithmic Arithmetic", Electronics Letters, Vol. 31, 1995, pages 1905 to 1906, and the 
erratum Vol. 32, 1996, page 2103, the entire disclosures of which are incorporated herein 
by reference. By these methods, subtraction in this region may be avoided. It will also be 
appreciated by persons skilled in the art, as mentioned above, that the quantities E(x), P(5), 

20 X and so on can be positive or negative as appropriate, and that adder stages may 
correspondingly be adder or subtracter as appropriate. 

It will be further appreciated by persons skilled in the art that carry-propagate-add stages 
occurring within multipliers 108 and 109 could be replaced with additional carry-save-add 
25 stages between the output of the now reduced multipliers (previously indicated 108 and 
109) and the final carry-propagate-add 1 12, to produce the same result. This is just one 
(non-limiting) example of what is meant by the phrase "or equivalent thereof in relation to 
the first and/or second multiplier means of the invention specified herein. 



Non-limiting examples of devices for implementing the invention are a general purpose 
microprocessor, a numerical microprocessor, a graphics processor, or a digital signal 
processor, and exemplary uses for the invention envisioned include it use in graphics 
accelerator boards, video games, communications equipment, computer controlled 
equipment, radar and sonar apparatus, and general purpose numerical processing 
equipment. 



